Adaptive ML-weighting in multi-band recombination of Gaussian mixture ASR

نویسندگان

Astrid Hagen

Hervé Bourlard

Andrew C. Morris

چکیده

Multi-band speech recognition is powerful in band-limited noise, when the recognizer of the noisy band, which is less reliable, can be given less weight in the recombination process. An accurate decision on which bands can be considered as reliable and which bands are less reliable due to corruption by noise is usually hard to take. In this article, we investigate a maximum-likelihood (ML) approach to adapting the combination weights of a multi-band system. The Gaussian Mixture Model parameters are kept constant, while the combination weights are iteratively updated to maximize the data likelihood. Unsupervised offline and online weights adaptation are compared to use of equal weights, and ‘cheating’ weights where the noisy band is known, as well as to the fullband system. Initial tests show that both MLweighting strategies show a robustness gain on band-limited noise.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spectral Entropy Feature in Multi-Stream for Robust ASR

In recent papers, entropy computed from sub-bands of the spectrum was used as a feature for automatic speech recognition. In the present paper, we further study the sub-band spectral entropy features which can give the flatness/peakiness of the sub-band spectrum and in turn the position of the formants in the spectrum. The sub-band spectral entropy features are used in hybrid hidden Markov mode...

متن کامل

MAP combination of multi-stream HMM or HMM/ANN experts

Automatic speech recognition (ASR) performance falls dramatically with the level of mismatch between training and test data. The human ability to recognise speech when a large proportion of frequencies are dominated by noise has inspired the “missing data” and “multi-band” approaches to noise robust ASR. “Missing data” ASR identifies low SNR spectral data in each data frame and then ignores it....

متن کامل

Multi-band speech recognition in noisy environments

This paper presents a new approachfor multi-band based automatic speech recognition (ASR). Recent work by Bourlard and Hermansky suggests that multi-band ASR gives more accurate recognition, especially in noisy acoustic environments, by combining the likelihoods of different frequency bands. Here we evaluate this likelihood recombination (LC) approach to multi-band ASR, and propose an alternati...

متن کامل

Asynchrony with trained transition probabilities improves performance in multi-band speech recognition

One of the central themes in multi-band automatic speech recognition (ASR) is to devise a strategy for recombining sub-band information. This in turn raises two questions: (1) at what phonetic unit should the recombination take place? (2) How asynchronously should the sub-bands be run? Theoretically asynchronous multi-band ASR should perform at least as well as synchronous multi-band ASR. Howev...

متن کامل

Noise Robust Speaker Identification Using Sub-Band Weighting in Multi-Band Approach

Recently, many techniques have been proposed to improve speaker identification in noise environments. Among these techniques, we consider the feature recombination technique for the multi-band approach in noise robust speaker identification. The conventional feature recombination technique is very effective in the band-limited noise condition, but in broad-band noise condition, the conventional...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2001

Adaptive ML-weighting in multi-band recombination of Gaussian mixture ASR

نویسندگان

چکیده

منابع مشابه

Spectral Entropy Feature in Multi-Stream for Robust ASR

MAP combination of multi-stream HMM or HMM/ANN experts

Multi-band speech recognition in noisy environments

Asynchrony with trained transition probabilities improves performance in multi-band speech recognition

Noise Robust Speaker Identification Using Sub-Band Weighting in Multi-Band Approach

عنوان ژورنال:

اشتراک گذاری